MML is not consistent for Neyman-Scott

نویسنده

  • Michael Brand
چکیده

Minimum Message Length (MML) is a popular method for statistical inference, belonging to the Minimum Description Length (MDL) family. It is a general name for any of several computationally-feasible approximations to the generally NP-Hard Strict Minimum Message Length (SMML) estimator. One often-cited showcase for the power of MML is the Neyman-Scott estimation problem, where most popular estimation algorithms fail to produce a consistent result. MML’s performance on the Neyman-Scott problem was analysed by Dowe and Wallace (1997) and by Wallace (2005) and MML was shown to be consistent for the problem. However, this analysis was not performed on SMML, but rather on two SMML approximations: Wallace-Freeman and Ideal Group. As for most estimation problems of more than a single dimension, the exact SMML solution is not known for Neyman-Scott. Our contributions: 1. We analyse the Dowe-Wallace solution, and show that it hinges critically on the use of an unnatural prior for the problem. We argue that the Jeffreys prior is a more natural prior to assume in this case. 2. Re-analysing the problem over its Jeffreys prior, we show that both the Ideal Group and the Wallace-Freeman approximations converge to the (inconsistent) Maximum Likelihood (ML) solution. The Wallace-Freeman MML approximation is perhaps the most frequently used MML flavour. 3. We develop novel techniques that enable determining properties of the SMML estimator for some general families of estimation problems without requiring a full construction of the estimator. 4. Using these tools, we show that the SMML estimator for many problems, including Neyman-Scott, is not a point-estimator at all. Rather, it maps each observation to an entire continuum of estimates. 5. We show that for every set of Neyman-Scott parameters, as well as in the asymptotic cases, the SMML estimate for each observation includes the Maximum Likelihood estimate. 6. In particular, we prove in this way that SMML is not consistent for this problem. The importance of this is that it demonstrates that the inconsistency is not only in Email address: [email protected] (Michael Brand) Preprint submitted to TBD October 17, 2016 the Ideal Group or any other MML approximation, but rather appears already in the strict form of MML, and therefore reflects properties of the MML framework itself. 7. We uncover some hidden connections between SMML, Wallace-Freeman MML and ML, and describe families of estimation problems for which all three coincide. 8. We discuss methodological problems in the arguments put forward by previous authors, who argued that MML is consistent, and specifically that it is consistent for the Neyman-Scott problem.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An estimating function approach to inference for inhomogeneous Neyman-Scott processes.

This article is concerned with inference for a certain class of inhomogeneous Neyman-Scott point processes depending on spatial covariates. Regression parameter estimates obtained from a simple estimating function are shown to be asymptotically normal when the "mother" intensity for the Neyman-Scott process tends to infinity. Clustering parameter estimates are obtained using minimum contrast es...

متن کامل

A Generalization of the Neyman-Scott Process

In this paper we introduce a generalization of the Neyman-Scott process (Neyman and Scott, 1958) that allows for regularity in the parent process. In particular, we consider the special case where the parent process is a Strauss process with offspring points dispersed about the parent points. Such a generalization allows for point realizations that show a mix of regularity and clustering in the...

متن کامل

Why Simplicity Is No Problem for Bayesians

The advent of formal definitions of the simplicity of a theory has important implications for model selection. But what is the best way to define simplicity? Forster and Sober ([1994]) advocate the use of Akaike’s Information Criterion (AIC), a non-Bayesian formalisation of the notion of simplicity. This forms an important part of their wider attack on Bayesianism in the philosophy of science. ...

متن کامل

Asymptotic Palm likelihood theory for stationary point processes

In the present paper, we propose a Palm likelihood approach as a general estimating principle for stationary point processes in Rd for which the density of the second-order factorial moment measure is available in closed form or in an integral representation. Examples of such point processes include the Neyman–Scott processes and the log Gaussian Cox processes. The computations involved in dete...

متن کامل

Synthetic plots: some history and examples

Jerzy Neyman and Elizabeth Scott developed the idea of synthetic plots. These plots are a display of the data values of an experiment side by side with a display of simulated data values, with the simulation-based on a considered stochastic model. The Neyman and Scott work concerned the distribution of galaxies on the celestial sphere. A review of their wo is presented here followed by personal...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1610.04336  شماره 

صفحات  -

تاریخ انتشار 2016